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CHARACTERIZATION OF A CLASS OF WEAK 
TRANSPORT-ENTROPY INEQUALITIES ON THE LINE 

NATHAEL GOZLAN, CYRIL ROBERTO, PAUL-MARIE SAMSON, YAN SHU, PRASAD TETALI 


Abstract. We study an optimal weak transport cost related to the notion of convex 
order between probability measures. On the real line, we show that this weak transport 
cost is reached for a coupling that does not depend on the underlying cost function. As 
an application, we give a necessary and sufficient condition for weak transport-entropy 
inequalities (related to concentration of convex/concave functions) to hold on the line. 
In particular, we obtain a weak transport-entropy form of the convex Poincare inequality 
in dimension one. 


1. Introduction 

The aim of this paper is to study a weak transport cost and its associated weak 
transport-entropy inequality, both introduced by four of the authors in [22], on the real 
line. In order to present our results, we shall first introduce the various mathematical ob¬ 
jects of interest to us, placing and motivating their significance within the classical theory 
of optimal transport and its connection with the concentration of measure phenomenon. 

Throughout the paper, V(R) denotes the set of Borel probability measures on M and 
7^1 (M) := {fj, G V{U) : \x\^{dx) < oo}, the subset of probability measures having a finite 

first moment. 

Let 0 : M'*' —)• M'*', with 0(0) = 0, be a measurable function referred to as the cost 
function. Then, the usual optimal transport cost, in the sense of Kantorovich, between 
two probability measures p. and u on M is defined by 

(1) 7^(u,/r) := inf JJ 9{\x - y\) 7r{dxdy), 

where the infimum runs over the set of couplings vr between p and u, i.e., probability 
measures on such that 7r{dx x M) = p{dx) and 7r(M x dy) = v^dy). 

Since the works by Marton [28, 29, 30] and Talagrand [36], these transport costs have 
been extensively used as a tool to reach concentration properties for measures on product 
spaces. More precisely, optimal transport is related to the concentration of measure phe¬ 
nomenon via the so-called transport-entropy inequalities that we now recall. A probability 
measure /x on M is said to satisfy the transport-entropy inequality T(0), if for all u £ V(R), 
it holds 

(2) Te{n,p) ^ii{n\p), 

where H(uj/i) denotes the relative entropy (also called Kullback-Leibler distance) of u with 
respect to p, defined by 

HHf.) ^=/log(|[)<i‘'. 
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if V is absolutely continuous with respect to and H(u|/i) := oo otherwise. Note that 
we focus on the line, but that all the above definitions easily generalize to probability 
measures on a general metric space. As a special case, as proved by Talagrand in his 
seminal paper [36], Inequality (2) is satisfied by the standard Gaussian measure for the 
cost Q{x) = x^/2. By extension, we shall say that jjL satisfies the inequality T 2 {C) (often 
referred to as “Talagrand’s inequality” in the literature), if (2) holds for a cost function 
of the form 9{x) = jC, for some C > 0. We refer to the books or survey [25, 19, 37, 10] 
for a complete presentation of transport-entropy inequalities and of the concentration of 
measure phenomenon, as well as for bibliographic references in the field. 

In the next few lines we shall shortly discuss the consequences of transport-entropy 
inequalities in terms of concentration. For simplicity we may only consider Inequality 
T2(C). 

As discovered by Marton and Talagrand, when a probability /U satisfies T 2 (C), then for 
all positive integers n, and all functions / : —)■ M which are 1-Lipschitz with respect to 

the Euclidean norm on M"', it holds 

(3) f/'if > med(/) +t) ^ ^ ^Jc\og{2), 

where med(/) denotes the median of / under [jJ^. We refer to [25, 10] for a presentation 
of the numerous applications of this type of dimension-free concentration of measure in¬ 
equalities. Conversely, it was shown by the first named author in [17] that a probability /r 
satisfying the dimension-free Gaussian concentration (3) necessarily satisfies T 2 (C), thus 
giving to Inequality T 2 a special status among other functional inequalities appearing in 
the concentration of measure literature. The key argument explaining why Talagrand’s 
inequality implies the dimension-free concentration behavior (3) is the well-known tensori- 
sation property enjoyed in general by inequalities of the form T(0) (see e.g. [19]). The 
tensorisation property shows in particular that if /i satisfies T 2 (C'), then the n-fold product 
measure fi <Si ■ ■ ■ 0 fi also satisfies T 2 (C') (on M"") with the same constant C. 

More generally, given a measure on a product space (which is not necessarily a prod¬ 
uct measure), and assuming that each of its conditional one-dimensional marginals sat¬ 
isfies a transport-entropy inequality, several authors have obtained, using different non- 
independent tensorisation strategies, transport-entropy inequalities for the whole mea¬ 
sure under weak dependence assumptions (see for instance [13, 39, 31, 38]). Then, the 
transport-entropy inequality for the whole measure leads again to concentration properties 
using the same classical arguments as in the product case. Thus, in many situations one is 
reduced to verify the one-dimensional transport-entropy inequalities, and therefore it is of 
a real interest to characterize those probability measures /r on M that satisfy the inequality 
T 2 and more generally T(0) for a general cost function 0. 

In this direction, the first-named author obtained, in [18], necessary and sufficient con¬ 
ditions for T(0) to hold, when the cost function 6 : —)• is continuous, convex and 

quadratic near 0. In order to present such conditions, we need to introduce some nota¬ 
tions. Denote by E^(x) := /i(—oo,x], x G M, the cumulative distribution function of a 
probability measure /r and by F~^ its general inverse defined by 

F~^{u) := inf{x G M, F^{x) ^ tt} G M U {± 00 }, Vu G [0,1]. 

The conditions obtained in [18] are expressed in terms of the behavior of the modulus of 
continuity of the non-decreasing map := E“^oFV, where r is the symmetric exponential 
distribution on M, 

= dx, 

2 


T{dx) : 
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SO that 


Up,{x) 


F-^ {l - ifx^O 

F-^ if X ^ 0. 


By construction, is the unique left-continuous and non-decreasing map transporting r 
onto fi {i.e. J f o U^dr = / fd^ for all /). In the special case of the inequality T 2 , the 
characterization of [18] reads as follows: a probability measure /r satisfies T 2 (C') for some 
C if and only the following holds 


• for some constant 6 > 0 and all u ^ 0, it holds 


sup(t/^(x -h m) - U^{x)) ^ -Vl + u, 

xSK 0 


• for some constant c > 0 and all / of class C^, the Poincare inequality holds 
(4) Var^(/) ^ c y /'^ d/i . 

We refer to [18] for a precise quantitative relation between C, b and c. 


In the present paper, partly following [18], we focus on the study of a new weak 
transport-entropy inequality introduced in [22] that is related to a weak type of dimension- 
free concentration. More precisely, in dimension one, we consider the weak optimal trans¬ 
port cost of u with respect to // defined by 


= inf 

TT 



yp{x,dy) 


p{dx ), 


where the infimum runs over all couplings n^dxdy) = p{x, dy)p{dx) of p and u, and where 
p(x, •) denotes the disintegration kernel of vr with respect to its first marginal. The 
notation bar comes from the barycenter entering in its definition. Note that, contrary to 
the usual transport cost, Te is not symmetric. Also, in terms of random variables, one 
has the following interpretation Fe{i^\p) = inf E (0(jA — E(yjA)j)) whereas Fe{i',p) = 
infE(0(jA — yj)), where in both cases the infimum runs over all random variables X,Y 
such that X follows the law p and Y the law u. As a consequence, when 9 is convex, by 
Jensen’s inequality, one has Te{v\p) ^ Te{v-,p)- Therefore, if a measure p satisfies T(d) 
then it also satisfies the following weaker transport-entropy inequalities. 


Definition 1.1. Let 6 : — )■ be a convex cost function. A probability measure p on 

M is said to satisfy the transport-entropy inequality 

T~'~(d): if for all v £ 'Pi(M), it holds 

Te{i^\p) ^ H(z^l/r); 

T (9): if for all u e 'Pi(M), it holds 

Te{pW) ^ H(z^l/r); 

T(d): if p satisfies both T'''(d) and T {9). 


In Section 4, we recall a dual formulation of these weak transport inequalities in terms 
of infimum convolution operators. In particular, the inequality T(d) appears as the dual 
formulation of the so-called convex (r)-property introduced by Maurey [32] (see also [34] 
for further development). 

The above defined weak transport-entropy inequalities are of particular interest since 
the class of measures satisfying such inequalities also includes discrete measures on M 
such as Bernoulli, binomial and Poisson measures [22, 34]. In comparison, the classical 
Talagrand’s transport inequality (and more generally any T(0)) is never satisfied by a 
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discrete probability measure unless it is a Dirac^. Moreover these weak transport-entropy 
inequalities also enjoy the tensorisation property (see [22, Theorem 4.11]), thus connecting 
them to a special dimension-free concentration behavior. For instance, as shown in [22, 
Corollary 5.11], a probability measure ^ satisfies T 2 (C') {i.e. T(0) with 9{x) = x'^jC, 
X £ M) if and only if, for all positive integers n and all convex or concave functions 
/: M” —>■ M which are also 1-Lipschitz for the Euclidean norm on M”, it holds 

/i"-(/ > med(/) +t) ^ ^ 

where to, C > Q are constants related to C (see [ 22 ] for a precise and more general 
statement). 

With these motivations in mind we shall prove the following characterization (the main 
result of the paper) of the transport inequalities T(0) associated to any convex cost function 
9 which is quadratic near 0. In the sequel we may use the following standard notation 
9{a ■) for the function x e-)- 9{ax). 


Theorem 1.2. Let fi £ 'Pi(M), to > 0 and 9 : M"*" — )• M'*' be a convex cost function such 
that 9{t) = t^ for all t ^ to- The following propositions are equivalent: 

(i) There exists a > 0 such that fv satisfies T{9{a ■)). 

(n) There exists b > 0 such that for all u > 0, 

sup {U^{x + u) - U^{x)) ^ -h tl). 

X 0 


Moreover, constants are related as follows: 

(i) implies (ii) with b = oki 

(ii) implies {i), with a = 6 k 2 , 


where ki := 


86»-i(lo*g(3)+t2) '^2 


min(l,to) 
2106»-i(2+4g) • 


Remark 1.3. In comparison with the characterization of the inequalities T(0) given in [18], 
one sees that only the condition on the modulus of continuity of remains. Nevertheless, 
as we shall explain below, the Poincare Inequality has not completely disappeared from 
the picture. 

Also, denoting by the modulus of continuity of defined by 

A;i(/i) = sup {U^{x + u) — U^{x),x £ M, 0 ^ u ^ /i} , h ^ 0, 

Condition {ii) asserts that 

A^(/i) ^ ^9-^{h + tl). 

Therefore is bounded above near zero but does not necessarily go to zero, as h goes to 
zero. In fact, if the measure /r is discrete and not a Dirac measure, the support of p, is not 
connected and so there exist a < b with a and b in the support of p such that p{]a, b[) = 0 . 
In that case, we may easily check that for all /i > 0, 6 — a ^ A^{h). This shows that in a 
discrete setting liuih^Q A^{h) > 0 . 


The proof of Theorem 1.2, given in Section 6 , is based on a refined study of the weak 
transport cost Te(/ilu) of independent interest. Indeed, we shall prove that, in dimension 
1 , all the optimal weak transport costs 'T${p\i') are achieved by the same coupling indepen¬ 
dently of the convex cost function 9. This result is well-known for the classical transport 
cost Te- More precisely, it follows from the works by Hoeffding, Frechet and Dall’Aglio 
[12, 16, 24] (see also [11]) that Te{p,n) = f 9 {\x — ri,_^(x)j) v{dx) where T^,^^ := o F^. 
In particular, given any two convex costs 9i,92, it holds Tej^+e 2 {h: = 'Thi {p, u) +Te 2 {p, u). 
Our second main result reads as follows (recall that ui ^ U 2 means that / / dui ^ J f di '2 


^Indeed, as mentioned above, the Poincare inequality is a consequence of Talagrand’s transport inequal¬ 
ity that forces the support of fi to be connected. 
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for all convex functions /: M —)■ M (one says that vi is dominated by V 2 in the convex 
order)). 

Theorem 1.4. Let /x, u G 'Pi(M) ; there exists a probability measure 7 dominated by u in 
the convex order, 7 ^ u, such that for all convex cost functions 9 it holds 

Te{n\fi) = 7 ^( 7 , At)- 

In particular, for any two convex cost functions 9i, 62 , it holds 

(5) T 81+82 t") = Tei (/X, p) + r02 (/X, p). 

The notion of convex ordering, characterized by Strassen [35] in terms of martingales, 
will turn out to be crucial in the understanding of the weak transport costs Tg. In Section 
2 , we recall certain classical properties of the convex order and in particular its geometrical 
meaning (in discrete setting) given by Rado’s theorem [33] (see Theorem 2.9). From this 
geometrical interpretation, we shall obtain an intermediate outcome (Theorem 2.10) that 
might be interpreted as a discrete version of Theorem 1.4. Finally, the proof of Theorem 
1.4, given in Section 3, will follow by an approximation argument. 

With the result of Theorem 1.4 in hand, we can briefly introduce the main ideas of the 
proof of Theorem 1.2. Following [18], the weak transport-entropy inequality (x) will follow 
from (u) by decomposition of the weak optimal cost 9 into two parts. One part is related 
to the quadratic behavior of 9 on [0, to] and the second part is related to its behavior for 
t ^ to- one has 9 ^ 9i + 92 with 

9i{t) := t^l[ 07 „](i) + (2tto - to)l[io,+oo)(i), 

and 

92{t) := [9{t) - t'^]+ = {9{t) - _^^^(t), t G M. 

Therefore, by Theorem 1.4, 

T 8 {a-){^\h) ^ T(0i+02)(a.)(ul^) =Te^{a-){^\lJ) + Te 2 (a-){^\lJ) ^ 2H(ul/x), 
for a proper choice of the constant a, where the last inequality will follow by relating the 
condition appearing in [ii) to the two weak transport-entropy inequalities with cost 0 i(a •) 
and ^ 2(0 ■ )• More precisely, following [18, Theorem 2.2], we will show in Theorem 6.1 that 
{ii) characterizes the weak transport-entropy inequality T( 02 (R‘ )) while, as stated in the 
next Theorem (our last main result), the weaker condition sup^. {U^{x -|- 1) — Ufj_{x)) ^ h, 
for some h > 0, characterizes the weak transport-entropy inequality T(0i(a •)) which thus 
appears (thank to [5]) to be also equivalent to the Poincare inequality (4) restricted to 
convex functions. 

Theorem 1.5. Let p, be a probability measure on M. The following assertions are equiv¬ 
alent: 

(i) There exists /i > 0 such that 

sup[Ufj,{x -A 1) - U^{x)] ^ h. 

{ii) There exists C > 0 such that for all convex function / on M it holds 

Var^(/) [ r^dp. 

JR 

{in) There exist D,lo> 0 such that p satisfies T {a) and T~''(a). Namely, it holds 
Ta{hW) ^ H{n\p) and Ta{i^\h) ^ H{i/\p) Vn G Vi{R), 
for the cost function a defined by 


6 


N. GOZLAN, C. ROBERTO, P.-M. SAMSON, Y. SHU, P. TETALI 


In particular, with k := 5480 and c := l/(10\/2), 

• (i) (in) with D = nh? and lo = c/h, 

• (Hi) ^ (ii) with C = 2D, 

• (i) (ii) with C = 

The equivalence (i) (ii) goes back to Bobkov and Gotze [5]. Hence, Theorem 1.5 
completes the picture by showing that (i) / (ii) also characterize the measures satisfying a 
weak transport-entropy inequality with a cost function which is quadratic near zero and 
then linear (like 9i). The dependence between the constants in the implication (ii) ^ (i) 
is not given for technical reasons. Indeed, the proof relies on an argument from [5] that 
uses a non trivial proof from [7] where one loses the explicit dependence on the constants. 

We indicate that during the preparation of this work, we learned that the character¬ 
ization of the convex Poincare inequality in terms of the convex (r)-Property (which is 
equivalent to the transport-entropy inequalities of Item (in), by duality, as we shall recall 
in Lemma 4.1) was obtained by Feldheim, Marsiglietti, Nayar and Wang in a recent paper 
[15]. 

The proof of Theorem 1.5 is given in Section 5. It uses results of independent interest like 
a new discrete logarithmic Sobolev inequality for the exponential measure r (Lemma 5.2). 
By transportation techniques, such a logarithmic-Sobolev inequality provides logarithmic- 
Sobolev inequalities restricted to the class of convex or concave functions for measures 
satisfying the condition in Item (i) (Corollary 5.1). Then the weak transport-entropy 
inequalities of Item (in) are obtained in their dual forms, involving infimum convolution 
operators (Lemma 4.1), by means of the Hamilton-Jacobi semi-group approach of Bobkov, 
Gentil and Ledoux [4], an approach also generalized in [26, 21, 22]. 

The paper is organized as follows. In the next section, we introduce and recall some 
known properties of the convex ordering that we shall use in Section 3 to prove Theorem 
1.4. Then, in Section 4, we very briefly recall the dual formulation of the weak-transport 
entropy inequalities T and T, borowed from [22], which will be useful later on. Finally, 
Section 5 and 6 are devoted to the proofs of Theorem 1.5 and 1.2, respectively. 

Acknowledgment: The authors would like to warmly thank Greg Blekherman for 
discussions on the geometric aspects in Section 2.3, including his help with the proof of 
Theorem 2.10. 


2. Convex ordering and a majorization theorem 

This section is devoted to the study of the convex ordering. After recalling some classical 
definitions and results, we shall prove a majorization theorem which will be a key ingredient 
in the proof of Theorem 1.4. 

2.1. A reminder on convex ordering and the Strassen Theorem. We collect here 
some basic facts about convex ordering of probability measures. We refer the interested 
reader to [27] and [23] for further results and bibliographic references. All the proofs are 
well-known, we state some of them for completeness. 

We start with the definition of the convex order. 

Definition 2.1 (Gonvex order). Given ui, U 2 G 'Pi(M), we say that U 2 dominates in the 
convex order, and write ui ^ U 2 , if for all convex functions / on M, f dui ^ J^f dv 2 . 

Remark 2.2. Observe that for any probability measure belonging to 'Pi(M) the integral of 
any convex function always makes sense in M U {-|-oo}. 

The convex ordering of probability measures can be determined by testing only some 
restricted classes of convex functions as the following proposition indicates. 

Proposition 2.3. Let vi,V 2 G 'Pi(IR) ; the following are equivalent 
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(i) Vi< V2, 

(a) J xvi{dx) = / xi' 2 {dx) and for all Lipschitz and non-decreasing and non-negative 
convex function / : M —)■ M"'', / /(x) vi{dx) ^ / f{x) i' 2 {dx). 

(Hi) f xvi{dx) = / xi' 2 {dx) and for all i G M, f[x — i]+ i'i{dx) ^ /[x — i]+ i> 2 {dx). 

For the reader’s convenience and for the sake of completeness, we sketch the proof of 
this classical result. We refer to [27] for more details. 

Sketch of the proof. Let us show that {i) is equivalent to {ii). First, since the functions 
X !-)■ X and x e-)- —x are both convex, it is clear that vi ■< V 2 implies / x vi{dx) = f x 1 ^ 2 (dx) 
so that (i) implies (ii). Conversely, since the graph of a convex function always lies above 
its tangent, subtracting an affine function if necessary, one can restrict to non-negative 
convex functions. Moreover, if /: M —)■ is a convex function, then /„: M —)■ M'*' defined 

by /n = / on [-n,n], fn(x) = fn(n) + fn(n)(x - n) \i x n and fn(x) = fn(-n) + 
f((—n)(x -|- n) if X ^ —n (where f( denotes the right derivative of /) is Lipschitz and 
converges monotonically to / as re goes to infinity. The monotone convergence Theorem 
then shows that one can further restrict to Lipschitz convex functions. Finally, up to the 
subtraction of an affine map, any Lipschitz convex function is non-decreasing, proving that 
(ii) implies (i). 

Now it is not difficult to check that any convex, non-decreasing Lipschitz function 
/: M —)• M"*" can be approached by a non-increasing sequence of functions of the form uq -\- 
Y)a=i oti[x — ti]j^, with a* ^ 0 and L G M. This shows that (ii) and (in) are equivalent. □ 

The next classical result, due to Strassen [.35], characterizes the convex ordering in terms 
of martingales. 

Theorem 2.4 (Strassen). Let i'i,i '2 G 'Pi(M) ; the following are equivalent: 

(i) Pi ^ P2, 

(ii) there exists a martingale (X,Y) such that X has law rei and Y has law ^ 2 . 

We refer to [22] for a (two-line) proof of Theorem 2.4 involving Kantorovich duality for 
transport costs of the form T. 

2.2. Majorization of vectors and the Rado Theorem. The convex ordering is closely 
related to the notion of majorization of vectors that we recall in the following definition. 
As for the previous subsection, all the proofs are well-known and we state them for com¬ 
pleteness. 

Definition 2.5 (Majorization of vectors). Let a, 6 G M"' ; one says that a is majorized by 
6 , if the sum of the largest j components of a is less than or equal to the corresponding 
sum of b, for every j, and if the total sum of the components of both vectors are equal. 

Assuming that the components of a = (ai,...,o,i) and b = (bi,... ,bn) are in non¬ 
decreasing order (i.e. ai ^ 02 ^ ... ^ a„ and 61 ^ 62 ^ ^ bn), a is majorized by b, 

if 

On T Oj^—i -|- • • • -|- an—j-\-l ^ bn T bn—I “1“ ‘ ‘ ‘ “1“ bn—j-\-l, for j — 1,..., re 1, 
andE^=l«^ = E^=l bi. 

The next proposition recalls the link between majorization of vectors and convex order¬ 
ing. 

Proposition 2.6. Let a,b G M” and set vi = ^ and ^2 = 

following are equivalent 

(i) a is majorized by b, 

(ii) vi is dominated by V 2 for the convex order. In other words, for every convex 
/ : M —)■ M, it holds that Y)'i=i f(ai) < Y)a=i f(bi) ■ 
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Thanks to the above proposition and with a slight abuse of notation, in the sequel we 
will also write a when a is majorized by b. 

Proof. Assume without loss of generality that the components of a and b are sorted in 
increasing order. We observe first that, by construction, the equality / xvi{dx) = / xv 2 {x) 
is equivalent to ai = X]r=i 

We will first prove that (i) implies {ii). By Item (in) of Proposition (2.3) we only need 
to prove that a <b implies 


( 6 ) 


Vt G 


J2[ak-t]+ ^ -i] + , 

k=l k=l 

Assume that t ^ maxa^ (otherwise (6) obviously holds). Then, let ko be the smallest k 
such that Ok ^ t so that “ ^]+ = J2k=koi^k — t). Therefore, by the majorization 

assumption (which guarantees that J2k=ko ^ J2k=ko ^k), we get 

n n n n 

- t]+ = ^ (Ofe - t) ^ ^ bk-t^^[bk- t] + . 

k=l k=ko k=ko k=l 

Conversely, let us prove that (ii) implies (i). Fix A; G {1, ... ,n} and set fk{x) := [x — bk] + , 
X G M. Plugging fk into Item (ii) of Proposition (2.3) leads to 

n n « p n n 

'^ai-bk^ ^[a* - bk]+ = n f(x)ui(dx) ^ n / f(x)v 2 (dx) = Y^[bi - bk]+ = ^bi- bk, 


i=k 


i=l 


2 = 1 


i=k 


□ 


so that J22=k ^ J27=k which proves that a is majorized by b. 

Next we recall a simple classical consequence of Proposition 2.6 in terms of discrete 
optimal transport on the line. 

Proposition 2.7. Let x,y € M"" be two veetors whose eoordinates are listed in non¬ 
deereasing order (i.e. xi ^ X 2 ^ ^ Xn, yi ^ y 2 ^ ^ yn)- Then for all permutation 

a of {1 ,... ,n} and all eonvex 0 : M —)• M, it holds 

n n 

^ e(xi - yi) ^ ^ e(xi - y^i^i)). 


2=1 


2=1 


Proof. Since, for all k, J2i=k Vi ^ T>i=k 2/a(i), it holds for J2i=k{xi - Vi) ^ J2i=k{xi - y^ii)) 
(with equality for k = 1). Therefore, denoting ya- = (^^(i), ■ ■ • ,ya{n))i it holds x — y < 
X — ya- Applying Proposition 2.6 completes the proof. □ 

Remark 2.8. In particular, let fj,, v are two discrete probability measures on M of the form 

n n 


n 


n 


i=l i=l 

where the xfs and the yfs are in increasing order, and assume for simplicity that the xfs 
are distinct. Then the map T sending x* on y* for all i realizes the optimal transport of y 
onto V for every cost function 0 . 

We end this section with a characterization of the convex ordering (or equivalently 
of the majorization of vectors, thanks to Proposition 2.6), due to Rado [33]. We may 
give a proof based on Strassen’s Theorem. For simplicity, we denote by Sn the set of 
all permutations of {1,2,..., n} and, given a G Sn and x = (xi,..., Xn) G M”, we set 

X(j . (Xcr(l)j • ■ ■ ) ®(T(n))' 

Theorem 2.9 (Rado). Let a,b G M” ; the following are equivalent 

(i) the veetor a is majorized by b; 

(ii) there exists a doubly stoehastie matrix P sueh that a = bP; 
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(in) there exists a eolleetion of non-negative numbers with = 1 such 

that a = J 2 fT£S„ Acr^o- (in other words a lies in the convex hull of the permutations 
ofb). 


Proof. First we will prove that (i) implies (m). According to Proposition 2.6, a ^ 6 is 
equivalent to saying that ^ ^ai is dominated by U 2 = ^ Y)Ji=i i^ convex 

order. Set X := {ai,...,a„}, y := {bi,...,bn}, kx := #{i G {l,...,n} : Oj = x}, 
X G X and iy = ff{i G {1,... ,n} : bi = y}, y G 3^ (where ff denotes the cardinality); 

observe that ^ Yl,x^x kx^x and ^2 = ^ According to the Strassen Theorem 

(Theorem 2.4), there exists a couple of random variables {X,Y) on some probability 
space (n,A, P) such that X is distributed according to ui, and Y according to 1^2 and 
X = E[y|A]. Since A is a discrete random variable, 

E[y|A] = V a.s. 

L J ^^F{X = x) ^ 


Therefore, for all x G X, 


nYix=x] 
F{X = x) 


^yyk^y,x, 

y&y 


where Ky^x '■= Hence a = bP with Pj^i := Kh.^ai, i, j = ■ ,n. This proves 

Item (ii), since P is doubly stochastic by construction. 

If a = bP with a doubly stochastic matrix P, then it is easily checked that /(^i) ^ 
X)r=i fi^i) for any convex function / on M so that (ii) implies (i). 

Finally, according to Birkhoff’s theorem, the extremes points of the set of doubly sto¬ 
chastic matrices are permutation matrices. Therefore every doubly stochastic matrix can 
be written as a convex combination of permutation matrices showing that (m) and {in) 
are equivalent. □ 


2.3. Geometric aspects of convex ordering and a majorization theorem. Con¬ 
trary to the previous subsections, the results presented here are new. Fix some vector 
b = ( 61 , 62 , • • •, bn) of M"' with distinct components (for simplicity). We will be working 
with the convex hull of the permutations of b, a polytope we denote by Perm( 6 ) and 
defined as 

Perm( 6 ) := < ^ Ao-&o-,with Ao- ^ 0 and ^ Ao- = 1 
[_o'e5„ o-eSn 

Such a polytope is often refered to as the Permutahedron generated by b. According to 
Rado’s Theorem 2.9, Perm( 6 ) = {a G M” : a Y b}. Hence, Perm( 6 ) is a subset of the 
following affine hyperplane 

{ n n 

X GlSP :'^Xi = '^bi> =b + £o, 

i=l i=l ) 

with To := {x G M"" : iCi = 0}. 

We will be interested in the faces, facets containing a given face, and normal vectors to 
such facets of Perm( 6 ). We need to introduce some notations. 

Denote by [n] the set of integers from 1 to re. For S C [re], let vs{b) denote the vector 
with the IT] largest components of b in the positions indexed by S (in decreasing order, 
say), and the remaining re — |5| lowest components of b in the other positions indexed 
by [re] \ S (also in a decreasing order). Also, when 5 / 0, we denote by Ps{b) the set 
that contains the vector vs{b) along with all vectors obtained by permuting any subset of 
coordinates of vs{b), as long as the subset is contained in S or in [re] \ S. (That is, the 
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only permutations that are not allowed are those that involve elements from both S and 
[n] \ S'). More precisely 

Ps{b) '■= {{vs{b))a,o' £ Sn such that cr(S) = S} 

where (t(S) := {(t(z), i £ S} denotes the image of S by a. 

More generally, given a partition S = (Si, S 2 ,..., Sk) of [n], let vs{b) denote the vector 
with the largest |Si| coordinates of b in the positions indexed by Si (in decreasing order), 
then the next largest IS 2 I coordinates in the positions indexed by S 2 and so on (as an 
illustration, for b = (1,4, 5,—2,3, 9, 6 ,—5) £ M® and S = (Si,S 2 ,S 3 ) with Si = { 1 , 2 }, 
S 2 = {3,6,7} and S 3 = {4,5,8}, we get vs{b) = (5, ii, 5, 1, — 2,4, 3, —5) where the italic 
positions refer to the set Si, the bold positions to the set S 2 and the remaining positions 
to S 3 ). Also, we denote by Ps{b) the set containing the vector vs{b) along with all vectors 
obtained by permuting the coordinates of vs{b) that belong to the same S*: 

Ps{b) := {(^^(ft))^, c £ Sn such that for all i,a{Si) = S*}. 

Now we recall two geometric definitions/facts from [3]. 

Fact 1: A facet of Perm( 6 ) is the convex hull of Psib), for some S 0, [n]. 

Fact 2 : A /ace of Perm(6) is the convex hull of P5 (6), for some partition S = (Si, S 2 ,..., Sfc) 
of [n] with k ^ 3. Furthermore, given a face F = Conv{Ps{b)), there exist ex¬ 
actly k — 1 facets containing F that are obtained by coalescing the first and last 
several S/s in S: that is, for each 1 < j < k — 1, the facet Fj containing F can 
be described by taking the partition [n] = T 1 UT 2 with Ti = Si U • • • U Sj, and 

r2 = s,-+i u • • • u Sfc. 

The next theorem, which we may call the Majorization Theorem, is a key ingredient in 
the proof of Theorem 1.4. It provides a geometric interpretation of majorization in terms 
of projection. 

Theorem 2.10 (Majorization Theorem). Let a,b € M"', assume that b has distinct coor¬ 
dinates and that a ^ Perm(5). Then the following are equivalent: 

(i) c G Perm( 6 ) satisfies 

a — c < a — c, Vc£ Perm( 6 ); 

(a) c is the closest point o/Perm( 6 ) to a; that is, 

c := arg min (||a —clU). 

cSPerm(b) 

Moreover the vector c is sorted as a : (oj ^ aj) ^ (q ^ Cj) , for all i,j. 

Let us recall that the orthogonal projection of a point a on the polytope Perm( 6 ) is the 
unique c £ Perm( 6 ) such that 

(7) (a —c, c —c)^0, Vc£Perm( 6 ). 

Proof. Observe that if 7 ^ bi, then letting d := a — ^( 1 , !,...,!) with k := 

Z)r=i ~J27=i bi: we see (using (7)) that the orthogonal projection of o and d on Perm( 6 ) 
are equal (to some point we denote by c, say), and that a — c ^ a — cif and only 
if a — c ^ d — c. Therefore we can assume without loss that a and b are such that 

E?=ia. = Er=i&*- 

We will first prove that (i) implies (m) which is the easy part of the proof. 

{i) (ii). Let c be the closest point of Perm( 6 ) to o (7e. c := argmin(,gPerm(fe)(ll®“c|| 2 )). 

Then by (i), a — c ^ a — c, which, by Proposition 2.6 (applied to f{x) = x^) implies that 
“ c)? < “ ^)i - definition of c, this is possible only if c = c. 

Next we will prove that (ii) implies (z). For the sake of clarity, we first deal with the 
simple case when c lies on a facet of Perm( 6 ), before dealing with the general case of c 
being on a face. 
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(m) (i). Let c be the closest point of Perm(5) to a. Since Perm( 6 ) is invariant by 

permutation, it easily follows from Proposition 2.7 that the coordinates of c are in the 
same order as the coordinates of a. Hence, we are left with the proof that a — c ^ a — c 
for all c G Perm ( 6 ). 

(a) A simple case: c € F for some facet F. Since c is chosen from Perm(6), and since 
we assumed that J2i bi = “u have “ c)i = 0. Writing a := a — c G So, suppose 
that a is perpendicular to the affine subspace H := T-Lp containing a facet F, defined by 
some nonempty subset S of [n]. For all x,y £ F, we thus have {a,x — y) = 0. Choosing 
X = vs{b) and y = x^^ obtained by permuting two coordinates of x whose indices are both 
in S or both in (7e. Tij = (ij) is the transposition that permutes i and j, with i,j G S, 
or i,j G 5'^), one sees that the coordinates of a are constant on S and S'^. We denote by 
as and as<^ the values of a on these sets, which verify kas + (n — k)asc = 0 since a G Sq- 

Now (recalling that a = a — c) our task is to show that 

a ^ a — (c — c), for every c' G Perm( 6 ) . 

This amounts to showing that 

a ^ a — c, for every c such that (a, c) ^ 0 , and ^ c* = 0 . 

i 

Indeed, on the one hand the choice of c implies, by (7), that for every S G Perm( 6 ) {a, c' — 
c) < 0, and on the other hand, since c, c' G Perm( 6 ), necessarily Ci = 0. 

Now (a, c) ^ 0 and Q = 0 together imply (recall that a is constant on S and S^) 
that 

{as -asc)^Ci ^ 0 . 
i&S 

Let us assume that as > as^- Then denoting by cs = J2i£S 

has C 5 ^ 0 and cgc ^ 0. Therefore, for any convex function / on M, according to Jensen’s 
inequality and by convexity, we get 

/(„, - e,) = -f j) 

k n — k 

> kf {as - I) + (n - k)f {as. - 

^ kf{as) + (n - k)f{asa) - f'{as)cs - f{asa)csa 

n 

^ '^f{o‘i), 

i=l 

where the last inequality comes from the fact that f'{as)cs + f'{ag)csa = cs{f'{as) — 
f'{asa)) ^ 0. According to Proposition 2.6, we conclude that a ^ a — c which is the 
expected result. 

(b) The general case. Suppose that c lies in a face F of the polytope. This face is 
related to a partition S = (5i,..., Sk) of [n], with k ^ 3. Then a := a — c £ N{F), where 
N{F) denotes the normal cone of F. Recall that the extreme rays of N{F) are given by 
the facet directions for the facets containing F. For alH G {1,..., n — 1}, let us denote by 
Fi the facet containing F associated to the partition % = {S'! U ... U S*; Si+i U ... U Sk}, 
1 ^ i ^ k — 1. Consider the vectors pi,p 2 , ■ ■ ■ ,Pk-i £ Sq defined by 

ki 

Pi = l5iU52U-USi - 

where It denotes the 0 — 1 indicator vector of T, for T C [n], and A:* = |5i| + • • • + |5j|. 
For each i, the vector pi is orthogonal to the facet Fj. Moreover, for all c G Perm( 6 ) one 
may check that {c,pi) ^ {vpi,Pi), with equality on Fi. This shows that pi is an outward 
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normal vector to Fi. Therefore N(F) is the conical hull of the pj’s, and so we may express 
a, for a suitable choice of A* > 0, as: 

« = X! '^*lSiUS2U-USi - O-lfn] ) 
i 

where cr = (l/n)Ef=i Aj|52| + - • • + Afc-i|5fc_i|] . In particular, a is constant 

on each Sj : for all i ^ Sj, ai = (^pZj — a := Aj. 

In order to establish (1), we need to show that 

a < a — {c — c), Vc e Perm(6), 

or in other words, we need to show that 

a < a — c , yd G Perm(6) — c. 

We now use again the fact that our choice of c implies that, for all 1 < i < A; — 1, 

{Pi, c) > (pi, c), Vc G Perm(6) . 

This in turn gives the following: 

Perm(6) — c C {c' : {d,pi) < 0, Vi} . 

Thus using N{F)^ := {d G To! {d,Pi) < 0, Vi} to denote the polar cone, it then suffices 
to show that for a (as above), 

a<a-d, yd^N{Ff. 

Now, d G N{F)^ implies that 

{d, ^SiUSiU-uSj) <0 and = 0 , 

i 

therefore denoting Ej = J2i£SiU...us- dii for all j G {0,1,..., A:}, one has Ej ^ 0 and 
Eo = Ek = 0. 

Let / : M —)• M be a convex function ; denoting by f its right derivative, the convexity 
of / implies that 

n k k k 

Y, fia, - dO = E E -d^)>Y\Sj\fiA,) - E 

i=l j=li£Sj j=l j=l 

where Dj = di. Now, using an Abel transform (and the fact that Eq = E^ = 0), 

one gets 

E f{A,)D, = E fiA,){E, - E,.,) = YifiAj) - f{A,+,)Ej ^ 0, 

j=i j=i j=i 

where the inequality comes from Ej ^ 0, Aj ^ and the monotonicity of f. Therefore, 
one gets 

n k n 

E/(«i -di)^Y\Sj\fiAj) = E/(«o, 

2 = 1 j = l 2=1 

which proves that a < a — d, thanks to Proposition 2.6, as expected. This completes the 
proof. □ 


CHARACTERIZATION OF WEAK TRANSPORT-ENTROPY INEQUALITIES 


13 


3 . Properties of the optimal coupling for weak transport costs 

This section is devoted to the proof of Theorem 1.4. We will establish first a preliminary 
result which gives some connection between T and T. In the sequel, we denote by Im(/i), 
respectively Im^(/i), the set of probability measures on M which are images of /r under 
some map S : M —)• M, respectively some non-decreasing map S', i.e., 

Im(/i) = {7 G P(M) : 3S : M —)■ M measurable such that 7 = 

and 

Im^(/r) = {76 'P(M) : 3S : M —>■ M measurable, non-decreasing, such that 7 = S^/i}. 
Proposition 3.1. For all probability measures p,, v on M, it holds 
inf ^ Te{y\F) ^ inf 

77i/,7eImT(/i) 7^i^>7eIm(/.4) 


Remark 3.2. Note that when p has no atoms, then Imi^(/r) = Im(/r). If /x is a discrete 
probability measure, then the two sets may be different. For instance, if /x = gdo + gdi, 
then 7 = 1^0 + is in Im(|u) but not in Imi^(/x). In the proof of Theorem 1.4 below, 
we will use Proposition 3.1 with [a being the uniform distribution on n distinct points for 
which it is clear that Imi^(/x) = Im(p). 


Proof. First we will prove that Te{i^\lJ-) ^ inf-y-<jy,-),gim(xi) To that aim, denote by 

TT{dxdy) = p{x, dy)p.{dx) some coupling between p, and v and set S{x) := / yp{x,dy), 
X G M. Clearly S^p G Im(/x). Moreover if / : M —?■ M is some convex function, by Jensen’s 
inequality, it holds 

J f{x)S#p{dx) = J f (^J yp{x,dy)^ p{dx) ^ JJ f {y) p{x, dy)p{dx) = j f{y)v{dy) 
so that S^p ■< V. Therefore, 

yp{x,dy)] p{dx)= j e{x-S{x))p{dx)f:Te{S#p,p) ^ inf Te{'^,p) 

/ J 7d!^,7Slm(xt) 

from which the claim follows by taking the infimum over p. 

Now we turn to the proof of the inequality Te{n\p) ^ ™f 7 -<j/ 7 gim^(;i) Assume 

that 7 ^ p and that 7 = S#p for some non-decreasing map S. According to Strassen’s 

theorem, there exists a coupling vri with first marginal 7 and second marginal u such that 

TTi{dxdy) = pi{x, dy)j{dx) and x = J^ypi{x, dy), 7 almost everywhere. For all x G M, 

define the following probability measure p{x,dy) := pi{S{x), dy). Then for all bounded 

continuous function /, it holds 

JJ f{y)p{x,dy) p{dx) = JJ f{y)pi{S{x),dy)p{dx) 

= II f{y)Pi{x,dy)-f{dx) = I f{y)n{dy). 

Thus the coupling 7r(dxdy) = p{x, dy)p{dx) has p as first marginal and u as second mar¬ 
ginal. Moreover, by definition of pi and p, p almost everywhere, it holds 

I yp{x, dy) = I ypi{S{x),dy) = S{x). 

Since S is non-decreasing, it realizes the optimal transport between p and u for the classical 
transport cost Te and so it follows that 

Te{l,p) = I ^( 1 ^ “ yp{x,dy)\)p{dx) Te{i^\p) 



which achieves the proof by taking the infimum over 7 . 


□ 


14 


N. GOZLAN, C. ROBERTO, P.-M. SAMSON, Y. SHU, P. TETALI 


We are now in a position to prove Theorem 1.4. 


Proof of Theorem l.f. The proof of the first part of Theorem 1.4 is divided into two steps. 
In the first step we will deal with uniform discrete measures on n points, while in the second 
step we will use an approximation argument in order to reach any measures. 

Step 1. We first deal with 

2 n n 

and u ■= 

n “ n 

1=1 1=1 

with ai < 02 <■■■< a-n and 5 i < 62 < ... < Set a := (oi,..., a„) and b := (61,..., bn)- 
According to Theorem 2.10, there exists some c G Perm(6) such that a — c < a — c, for 
all c G Perm( 5 ). Moreover the coordinates of c satisfy Cj ^ h+i- Set 7 ^ Y^l=i ^ci and 

observe that v dominates 7 for the convex order and 7 G Im^(/i). (Recall the definition 
from the beginning of this section.) 

Now for any 7 := ^ Yl'i=i £ Im^(/i) with Cj ^ Cj+i and for any convex cost function 
0, it holds (since the coordinates are non-decreasing) 


Te{l,h) = 


1 ” 

- Ci\). 


In particular 

( 8 ) 


70(7,/i) =-^6»(|oj - Cil) ^ inf 
n -f— cePerir 


1 


i=l 


cGPerm(b) Tl 


CLi Ci 


A probability 7 such that 7 ^ u, 7 G Im^(/r) is of the form 7 = ^ Yl^=i ^ci with Cj ^ Cj+i 
and c = (ci,..., Cn) G Perm( 6 ), and for such a c, it holds ^ Yll=i ^(1®* “ qI) = 7 ^( 7 , ^). 
Therefore, the latter implies 

Tb{ix) ^ inf Te{'y,^^) = Te{v\h) 

where the last equality follows from Proposition 3.1 and the fact that for such a distribution 
fi, it holds Im(/r) = Imi^(/r) (see Remark 3.2). Since obviously Teii^ln) ^ 7 ^( 7 ,/i), we 
conclude that = T 0 (p|//) as expected. 

Step 2. In the second step we deal with the general case using an approximation 
argument. 

Let fi and be two elements of 'Pi(]R). By assumption, / |x| fj,{dx) < 00 and / \x\ u{dx) < 
00 , hence, according to the de la Vallee-Poussin Theorem (see e.g. [9, Theorem 4.5.9]), 
there exists an increasing convex function (5 : —?■ M"*" such that /3(t)/t —)■ 00 as t ^ 00 

and such that / /3(|x|) fi{dx) < 00 and / /3(|x|) u{dx) < 00 . 

Next we will construct discrete approximations of p and v. According to Varadara- 
jan’s theorem (see e.g. [14, Theorem 11.4.11]), if Xi is an i.i.d sequence of law fi, 
then, with probability 1 , the empirical measure := ^J2?=i^Xi converges weakly to 
fi. On the other hand, according to the strong law of large numbers, with probability 
1 , I \^\ h{dx) as n —)• 00 . Let us take (xi)j^i, a positive realization of 

these events and set gin = ^ Yll=i ^ (") j where ^ x^"^^ ^ ... ^ xi”"^ denotes the in- 

creasing re-ordering of the vector (xi, X 2 , ■.., x„). Then the sequence fin converges weakly 
to fi and J\x\ fin{dx) —)■ J\x\fi{dx). According to [37, Theorem 6.9], this is equivalent 
to the convergence of the ITi distance : Wi{fin, fi) —)■ 0 as n —>■ 00 . Note that one 
can assume that the points x) ' are distinct. Indeed, if this is not the case, then let¬ 
ting x\^'^ = xl^'^ P ijr? one obtains distinct points and it is not difficult to check that 
An = 7 Z]r=i still weakly converges to fi (for instance the Wi distance between fin 

and An is easily bounded from above by (n -|- l)/(2n^)). The same argument yields a 
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sequence Un = ^ Z)r=i ^y(^) with converging to v in the Wi sense. It is not 

difficult to check (invoking the strong law of large numbers again) that one can further 
impose that / /3(|x|) Vn{dx) —)■ / /3(|x|) v{dx), as n —)• oo. 

For all re ^ 1, one applies the result proved in the first step : there exists a unique 
probability measure 7 ^ ^ such that 

'To{l^n\l^n) — IJ^n)) 


for all convex cost functions 9. Let us show that one can extract from 7 ^ a subsequence 
converging to some 7 in 'Pi(M) for the Wi distance. By construction / /S(|x|) re„(dx) —)■ 
/ I3{\x\) v{dx) and so M = sup„^]^ / j3{\x\) Un{dx) is finite. Since % ^ and since the 
function x 1 —)■ /?(|x|) is convex, it thus holds / /3(|x|) 7 „(dx) ^ / j3{\x\) Un{dx) ^ M. In 
particular, setting c{R) = /3{t)/t, R> 0, Markov’s inequality easily implies that 




\x\ 7 „(dx) ^ 


//3(|x|) Vnjdx) 

c{R) 


M 

Wr 


Consider 7 ^ defined by ^^{x) 


i+kl 

J l-l-hl ■j„{dx) 


. Then it holds. 


SUP7n([-^,-R]'') ^ 
n71 


2M 

c(i2)’ 


Vi? ^ 1 , 


and so the sequence jn is tight. Therefore, according to the Prokhorov Theorem, extracting 
a subsequence if necessary, one can assume that 7 ^, converges to some 7 for the weak 
topology. Extracting yet another subsequence if necessary, one can also assume that 
/(I + \x\)'yn{dx) converges to some number Z > 0. The weak convergence of 7 ^ to 7 
means that / ip d^n f ‘pdj for all bounded continuous (p, which means that 

y(l + |a:|)¥^(x) %(dx) ^ J{I + |x|)y?(x) ^(dx), 

where j{dx) = ^{dx) E 'Pi(M). Invoking again [37, Theorem 6.9], this implies % —)• 7 

as re —)■ 00 for the IFi distance. 

Now we will check that 7 is such that 70 (re|/i) = Tg{'y,fJ.) for all convex cost functions 
9 : M'*' —>■ M'*'. First assume that 9 is Lipschitz, and denote by Lg its Lipschitz constant. 
According to [22, Theorem 2.11], the following Kantorovich duality formula holds 


TgivnlUn) = sup Qgip{x) Vn{dx) - J ip{y) Pnidy) j , 

where the supremum is taken over the set of convex functions ip bounded from below, with 
Qgip{x) := infy6]R{v7(y) + 9{\x - y\)}, x G M. Define ip{y) := sup3.g]R{Q0V9(x) - 9{\x - y])}. 
Then it is easily checked that ip ^ ip, ip is bounded from below and Qg<p = Qgp. Moreover, 
being a supremum of convex and Le-Lipschitz functions, the function p is also convex 
and Lg-Lipschitz. Therefore, the supremum in the duality formula above can be further 
restricted to the class of convex functions which are L^-Lipschitz and bounded from below. 
Using the fact that Wi{iyn,^) = sup{/ f dvn — J f dv} where the supremum runs over 
1-Lipschitz function and the fact that Qgp is L^-Lipschitz (being an infimum of such 
functions), we easily get the following inequality 

\Tg{Un\Pn) - Tg{l^\p)\ ^ LgWl^l^n,!^) + LgWl{yn,p)- 

A similar (but simpler reasoning) based on the usual Kantorovich duality for Tg yields the 
inequality 

\Tg{%,yn) - Tg{j,y)\ ^ LgWi{%,^) + LgWi{yn,p)- 
Passing to the limit as re —>■ 00 in the identity Tg{vn\Pn) = Tg{^n, Pn), we end up with 
Tg{iy\p) = Tg{j,fJ,). 







16 


N. GOZLAN, C. ROBERTO, P.-M. SAMSON, Y. SHU, P. TETALI 


Now it remains to extend this identity to general convex functions 0 not necessarily 
Lipschitz. Let 9 : M"*“ —)• be a convex cost function (such that 0(0) = 0) and for 

all n ^ 1 , let 6n be the convex cost function defined by 9n{x) = 0 (x), if x G [ 0 ,n] and 
9n{x) = 6{n) + 9'{n){x — n), if x ^ n, where 0' denotes the right derivative of 0. It is 
easily seen that 0^ is Lipschitz and that converges to QdP> monotonically as n —)• oo, 

for any function tp bounded from below. Therefore, the monotone convergence theorem 
implies that for any probability measure 7 , it holds / Qoipd'y = sup^^;^ / Qo^ipd'^. We 
deduce from this that Te{v\p) = sup^^^ T 6 i„(u|/i) and Te{^\p) = sup„^]^ 70 „( 7 , ^). Since 
Te„(p|/i) = 7 e„( 7 ,/i) for all n ^ 1, this ends the proof of the first part of the theorem i.e. 
that Te(u|/i) = Te[i\p)). 

From the first part of the theorem we conclude that there exists some 7 G 'Pi(M) such 
that Te{'i^\p) = for the three cost functions 0 = 0i,02,0i + 02- The result then 

follows from the well-known additivity of 7 e in dimension one: 70^+02(75 A*) = + 

7^2(7) A*) • This ends the proof of the theorem. □ 


4. Dual formulation for weak transport-entropy inequalities. 


In this short section we recall the Bobkov and Gotze dual formulation of the transport- 
entropy inequality ( 2 ) and its extensions, borrowed from [ 22 ], related to the transport- 
entropy inequalities of Definition 1.1, in terms of infimum convolution inequalities. The 
results are stated in dimension one to fit our framework but hold in more general settings 
(see [22]). They will be used in the next sections. 


Lemma 4.1. Let fi G T’i(M) and 9 : M"*" —> 
functions 5 : M —)■ M bounded from below, set 


be a convex cost function and, for all 


Qtg{x) := inf f{y) + t9 

j/SK 


k - y\ 


t 


A > 0, X G 


Then the following holds. 

(i) fj, satisfies T(0) if and only if for all g : 


exp 


-)• M bounded from below it holds 

J QigdfiJ exp J gdp^ ^ 1 . 


(a) fi satisfies T (0) if and only if for all convex g : 
holds 


exp 


J Qigdy^ J exp{-g) dp ^ 1 . 


bounded from below it 


bounded from below it 


(Hi) jjL satisfies T (0) if and only if for all convex 5 : M — 
holds 

J exp{Qig) dgexp (^- J gdy^ ^ 1 . 

(iv) If y satisfies T(0), then for all convex o' : M —)■ M bounded from below it holds 


( 9 ) 


exp(Qtg') dy J exp{-g) dy ^ 1 , 


with t = 2. Conversely, if y satisfies (9) for some t > 0, then it satisfies T(A0( ■ /t)). 


Proof. The first item is due to Bobkov and Gotze [ 6 ] and is based on a combination of 
the well-known duality formulas for the relative entropy and for the transport cost Te. 
Items {ii) and {in) generalize the first item to the framework of weak transport-entropy 
inequalities. We refer to [22, Proposition 4.5] for a more general statement and for a proof 
(based on an extension of duality for weak transport costs). 

Finally we sketch the proof of Item {iv) (which already appeared in a slightly different 
form in [19, Propositions 8.2 and 8.3]). By the very definition, if y satisfies T(0) then it 
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satisfies T^(0) and therefore, it satisfies the exponential inequalities given in Items {ii) 
and {in). Note that if g is convex and bounded from below then Qig is also convex and 
bounded from below. Therefore it holds 


exp 


Qigdg 


j exp(- 5 r) d/i ^ 1 


and 

J exp{Qi{Qig)) dgexp J Qigdg^ ^ 1 . 

Multiplying these two inequalities and noticing that Qi{Qig) = Q 2 g (for a proof of this 
well-known semi-group property, see e.g [37, Theorem 22.46]) gives (9) with t = 2. The 
converse implication simply follows from Jensen’s inequality. □ 


5. A TRANSPORT FORM OF THE CONVEX POINCARE INEQUALITY 


This section is devoted to the proof of Theorem 1.5. Since, from [5], Item (i) is equiv¬ 
alent to Item {a), and since it is easy to prove that Item {in) implies Item {ii) we will 
mainly focus on the implication {i) ^ {in). Our strategy is to prove a modified loga¬ 
rithmic Sobolev inequality for the exponential probability measure r and then, using a 
transport argument, a modified logarithmic Sobolev inequality for general fj, (satisfying 
the assumption of Item {i)) restricted to convex or concave Lipschitz functions. Finally, 
following the well-known Hamilton-Jacobi interpolation technique of [4] , the desired trans¬ 
port inequalities will follow in their dual forms (recalled in Lemma 4.1). 

We need some notations. Given a convex or concave function 5 : M —)■ M, we set 

( 10 ) |V 5 (|(x) := mm{\eg_{x) + {1 - e)g'^{x)\-,6 € [ 0 , 1 ]}, 

where g'_ and g'_^_ denote the left and right derivatives of g (which are well-defined every¬ 
where). In particular, if g is convex 


and if g is concave 


1 

r I4(^)i 

if g'^{x) 

^0 

|Vffl(x) = 

0 

if g'_{x) 

^ 0 ^ g'^{x) 

1 

1 g-{x) 

if g'_{x) 

^0, 

1 

r i<7'-(^)i 

if g'_{x) 

^0 

|Vffl(x) = 

0 

if g'j^{x) 

^ 0 ^ g'_{x) 

1 

1 g+{^) 

if g'j^{x) 

^ 0. 


The following result is one of the key ingredients in the proof of Theorem 1.5. Recall 
the definition of from the introduction. 


Proposition 5.1. Let g. be a probability measure on M. Assume that sup3,gR[17^(x J- 1) — 
U^{x)\ ^ h for some /i > 0. Set K := 2740 and c := l/(10\/2). Then, for all eonvex or 
coneave and l-Lipsehitz funetions g with I ^ c/h, it holds 

(11) Ent^(e®) ^ Kh? [ \Vg{x)\‘^e^^^^g{dx). 

JM. 


The proof of Proposition 5.1 is postponed to the end of this section. 

Proof of Theorem 1.5. As already mentioned above, from [5] we conclude that Item {i) 
is equivalent to Item {ii). In order to make the dependency of the constants explicit in 
the implication (z) {ii), one can use a well-known expansion argument: apply (11) to 

ef and take the limit e —)■ 0, see e.g. [2]. On the other hand, using a similar expansion 
argument, it is easy to prove that Item {in) implies Item {ii) with C = 2D: apply (9) to 
g = ef and take the limit e —)• 0, see e.g. [19, 20]. Hence, we are left with the proof of (i) 
implies {in) which closely follows the Hamilton-Jacobi semi-group approach introduced in 

[ 4 ]. 
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Let /i be a probability measure on the line and assume that Item (i) of Theorem 1.5 
holds. According to Proposition 5.1, for any convex or concave differentiable function g 
which is /-Lipschitz with I ^ c/h := lo, it holds 

( 12 ) ^ Kh'^ [ fi{dx), 

Jr 


with K = 2740 and c = l/(10-v/2). It is easy to check that the latter is equivalent to 
(13) Ent^(e3) ^ j a*{\Vg\)e3 dg,, 

for all convex or concave 5 : M —>■ M with 


a*{v) := sup{riu — a(n)} 

U 


Kh^v^ if lul ^ lo 
+00 if |p| > lo 


• + r / \ I if ^ 2loKh? 

the convex conjugate of a{u) := | M ^ . 

Now, introduce the inf-convolution operators Qt, for t G (0,1], defined by 

Qi/( 3:) := inf {/(y)-hta , x E M, tE(0,l], 


which makes sense for instance for any Lipschitz function / or for any function / bounded 
from below. For simplicity denote by J- the set of functions /: M —?■ M that are /-Lipschitz 
and concave, I ^ lo, or convex and bounded below. Then, Qt satishes the following 
technical properties: 

(а) If / is convex, then Qtf is convex. 

(б) If / is concave and Lipschitz, then Qtf is concave. 

(c) If / E T", then Qtf is /o-Lipschitz. 

(d) If / E T", then the function u{t, x) := Qtf{x) satisfies 
Jacobi equation 

(14) -^u{t,x) + a*{\V~u\){t,x) = 0, VtE(0,l], 

dt-^ 

where is the right time-derivative and |V“u(/,x)| = 

(where as usual [vT]_ := max(—A, 0) denotes the negative part). 

Item (a) is easy to check and is a general fact about infimum convolution of two convex 
functions (/ and a). Item (b) follows from the fact that, after change of variables, Qtfix) = 
infu {f(x — u)+ta{j)} so that Qtf is an infimum of concave functions and is therefore 

also concave. As for Item (c) we observe that x ^ ta is /o-Lipschitz for any y so 

that Qtf is also /o-Lipschitz as an infimum of /o-Lipschitz functions. A proof of Item (c/) 
can be found in [21] or [Ij. We observe that the conclusions of Item (c) — (d) hold in much 
more general settings. 

With these properties and definitions in hand, let / E d-" and (following [4]) define 

F{t) := ^log dgj , 


the following Hamilton- 


Vx E 


limsuo Ht,y)-u{t,x)] 
nmsup^^j, |^_^| 


t E (0,1). 
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The function F is right differentiable at every point t > 0 (thanks to the above technical 
properties of Qt, see e.g. [21] for details) and it holds 

d . 1 

= 


Ir 

1 1 




F /j^ FQtf dfi 
/r dix 


WT,. i (^“5./) e‘ 0 -^ <!,.) 
(Ent^ - KhH^ J | dfi 

J |vgi/|V«‘^d/i-^|v-Qi/|V«‘^dM) ^0, 


where the second equality follows from (14), the first inequality from (13) applied to the 
function g = tQtf (which is convex or concave and t/o-Lipschitz) and the last inequality 
from the fact that for a convex or concave function g, jV^j ^ |V“ 5 | (we recall that jVg'j 
is defined in (10)). 

Thus the function F is non-increasing and satisfies T’(l) ^ limt^o-^(^) = / fd^- In 
other words, 


(15) 


J dfi^el yfeF. 


Now according to Item (in) of Lemma 4.1 one concludes (on the one hand) that fi satisfies 
the transport-entropy inequality T (a): Ta(/i|p) ^ LI(u|/i), for all u G 'Pi(M). 

On the other hand, applying (15) to / = —Qig with g convex and bounded from 

below (so that / is concave and ^-Lipschitz) yields to dfj, ^ 1. Since 

Qi(—Qig) ^ —g we end up with 

jQigd;, J ^ 


for all g convex and bounded from below. According to Item (ii) of Lemma 4.1, this 
implies that n satisfies the transport-entropy inequality T''~(q:): Ta(p|/i) ^ Lf(u|^), for all 
u G 'Pi(M), which completes the proof. □ 


The end of the section is dedicated to the proof of Proposition 5.1. 


Proof fo Proposition 5.1. Let K and c be defined by Lemma 5.2 below. We may deal first 
with convex functions g and divide the proof into three different (sub-)cases: g monotone 
(non-decreasing and then non-increasing), and g arbitrary. 

Assume first that g is convex non-decreasing and LLipschitz with I ^ c/h. Set / = goUfj, 
(recall that is defined in the introduction). Then, since g is non-decreasing, and since 
Ufj,{x — 1) ^ U^{x) — h by assumption, for all x G M, it holds 

f{x) - f{x - 1) ^ g{U^{x)) - g{U^{x) -h) ^Ih^c, Vx G M. 

Therefore, since /r is the image of r under the map C/^, Lemma 5.2 (apply (16) to /) and 
the latter guarantee that 

Ento(e®) = Entr(e-^) ^ K [ (/(x) — /(x — 1))^ e-^^*V((ix) 

Jr 

f {g{U^{x)) - g{U^{x) - h))^ e^dJ,(-))r(dx) 

Jr 

= K f {g{x) — g{x — h))"^ e^^^^fi(dx) ^ Kh^ [ \Vg(x)\^e^^^^pL{dx)., 

«/M 

where the last inequality is due to the fact that g is convex and non-decreasing and 
therefore satisfies 0 ^ g(x) — g{x — h) ^ g'_(x)h = \Vg{x)\h. As a conclusion we proved 
(11) for all convex non-decreasing and LLipschitz functions g with I ^ c/h. 
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Now suppose that g is convex, non-increasing and ^-Lipschitz with I ^ c/h and set 
f{x) = g{Uf^{—x)). The function / is non-decreasing and, since Uf^{—x+ 1) ^ Uf^{—x) + h 
by assumption, satisfies 


f{x) - f{x - 1) = g{Uf,{-x)) - g{U^,{-x + 1)) ^ g{Uf,{-x)) - g{U^,{-x) + h) ^ c. 
Similarly to the previous lines. Lemma 5.2 implies that 

Ent^(e^) = Ent.(e^) ( {g{U^{-x)) - g{U^{-x + 1)))^ 

JR 

f {g{U^{-x)) - g{U^{-x) + h)f 
JR 

= K f {g{x) — g{xh))"^ g.{dx) ^ Kh^ f \Vg{x)\‘^e^^^^g.{dx), 

Jm. «/M 


where we used the symmetry of r and that 0 ^ g{x) — g{x + h) ^ g'_^_{x){—h) = \'Vg{x)\h. 
Therefore we proved (11) for all convex non-increasing and LLipschitz functions g with 
I ^ c/h. 

Einally, consider an arbitrary convex and LLipschitz function g with I ^ c/h and assume 
without loss of generality that g is not monotone. Being convex, there exists some a G M 
such that g restricted to (—oo,o] is non-increasing and g restricted to [a, oo) is non¬ 
decreasing. Subtracting g{a) if necessary, one can further assume that g{a) = 0 since (11) is 
invariant by the change of function g ^ g-\-C (for any constant C). Set gi = 5l(_oo,a] 
g 2 = (7l(a,oo)- The functions gi and g 2 are convex, monotone and LLipschitz. Therefore, 
according to the two previous sub-cases, it holds 

/ a /•+CO 

\Vg{x)\'^gL{dx) and Ent^(e^^) ^ Kh? / \Vg{x)\‘^g,{dx). 

-OO Ja 


So what remains to prove is the following sub-additivity property of the entropy functional 

Ent^(e®i+®2) ^ Ent^(e^i) -hEnt^(e^2)^ 
which, since / ge^dfj = / gie^^dg, + J g 2 e^'^dfj, amounts to proving that 


J e^^dg,log e^^dfj^ J e^^d/rlog e^^dg^ ^ J dglog e^dg 


Setting A = f e^^dg—1, B = f e^^dg—1 and X = f e^dg and observing that = X 

the latter is equivalent to proving that 


{A + 1) log(A + 1) + (B + 1) log(B + 1) ^ X logX , 

which follows from the sub-additivity property of the convex function x e-)• [x -\- 
l)log(x -|- 1) on [0,oo), that satisfies <h(0) = 0. This completes the proof when g is 
convex. 

The case g concave follows the same lines (use (17) instead of (16)). Details are left to 
the reader. □ 


In the proof of Proposition 5.1 we used the following lemma which is a (discrete) variant 
of a result by Bobkov and Ledoux [8] and an entropic counterpart of a result (involving 
the variance) by Bobkov and Gotze (see [5, Lemma 4.8]). 

Lemma 5.2. For all non-decreasing function f \ M —)• M with f (x) — f {x — 1) ^ l/{lt)y/2), 
X G M, it holds 

EntT-(e-^) ^ 2740 [ (/(x) — /(x — 1))'^ e’^dr. 

Jr 


EntT-(e-^) ^ 2740 [ (/(x-|-1) —/(x))^ C'^dr. 
Jr 


(16) 

and 

(17) 


CHARACTERIZATION OF WEAK TRANSPORT-ENTROPY INEQUALITIES 


21 


Proof. Let be the exponential probability measure on M'*': T~^(dx) = e“*l[g We 

shall use the following fact, borrowed from [5, Lemma 4.7] (with a = 0 and /i = 1 so 
that the constant c{a,h) appearing in [5] can be explicitly bounded by 1/200): for all 
/: [—1, oo) —)■ M non-decreasing and satisfying /(O) = 0, it holds 

(18) J fdr'^ ^ 200 J (/(x) - /(x - l)fdT^{x). 


We will first prove (16). Since (16) is invariant by the change of function / —?■ f + C for any 
constant C, we may assume without loss of generality that /(O) = 0. Set f{y) := — /(—y), 
y £ M and observe that / is non-decreasing. Since ulogu ^ u — 1 for all u ^ 0, one has 

EntT-(e'^) ^ J{fe^ — + l)dT = J tf^e^^dt^ dr 

=u°° (i' """+5 r (/o’ 

(19) fe^dT+ + ^J pdT+, 

where the last inequality comes from the fact both / and / are non-negative on M'*'. Now 
suppose that the function / is such that /(y) — /(y — 1) ^ c for all y £ M and some 
c £ (0,1). Our aim is to bound each term in the right hand side of the latter. 

By (18) applied to the function /, one has 

J PdT~^ ^ 200 j {f{y) - f{y - l))^dT+(y) = ^ j (/(?/) “ Pv ~ dr^iv) 

«—' (PiSPiiw&S^] /■'»> - *" - 


where we set = J {f{y)^\{y^\)PdT+ (y) used Jensen’s inequality to guarantee 

that 1// e~Py ^ e-f . By Cauchy-Schwarz’ inequality and using (18) again, we get 


J f{y){f{y) - f{y - '^pdr^iy) ^ (^J (/(y) - f{y - l))^dr+(y)^ (^j pdr^j 


\ 1/2 


^ Vm)c 


- hy-^)?dT^{y). 


It finally follows that 

I pdT+ ^ 2006^^1 ipy) - py - dT+(y) 

= 200e^^ r (/(y + 1) - /{ypP^^^ey dy 

J —OO 

= 2006'^'"“^ f {f{y) - f{y - l)fe-''‘^y~^py dy 

J — OO 

^ 400e'^''+^ [ {f{y) - f{y - l))2e-^(^) dT{y) , 

J —OO 


where in the last line we used that ei'/(e“l^l/2) ^ 2e^ for all y ^ 1. 

Next we deal with the first term in the right hand side of (19). Our aim is to apply 
(18) to y = Observe that, since / is non-decreasing, /(x) ^ /(—I) ^ —c + /(O) = 

—c ^ —1 so that, since x e-)• xe’^/^ is non-increasing on [—2,oo) we are guaranteed that g 
is non-decreasing on [—l,oo) and therefore that we can apply (18) to g. Applying (18) to 
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g = and using the inequality 

0 ^ ^ (6 — a)e^^‘^ + ^(6 — a)e^/^, —2^a^b, 

we get 

B := j f'^e^dT~^ ^ 200 J — f{y — dT^{y) 

^ 400 J (/(y) - f{y - dT^{y) + 100 J f{y){f{y) - f{y - dT^{y) 

^ 400 Jifiy) - f{y - dr+(y) + lOOc^S. 

Therefore, provided c < 1/10 we end up with B ^ 400/(1 — lOOc^) J{f{y) — f{y — 

l))2g/(y) (^ 7 -+(y)_ Hence, plugging the previous two bounds into (19) and choosing c = 

l/\/200. Inequality (16) follows with the better constant 939 in factor of the right hand 
side. 

To obtain (17) from (16), it suffices to observe that, by a simple change of variables 

/* r — \y~^ ^ I 

J ifiy) - f{y - l))^e-^(^) dT{y) = J {f{x + 1) - /(x))^e^(*+^)^^^— dy 

^ e‘^+1 J {fix + 1) - /(x))2e^W dT{x) 

and that 9396^^+^ ^ 2740 for c = l/\/200. This ends the proof. □ 

6. Proof of Theorem 1.2 

In this final section we prove Theorem 1.2. As mentioned in the introduction, we may 
need to decompose the cost 9 into the sum of two costs, one that takes care of the behavior 
near 0 (the cost 6 i) and the other one vanishing in a neighborhood of 0 (the cost 62 ). The 
next theorem deals with the latter. 

Theorem 6.1. Let g G Pi(M) and /3 : M'*' —)• M'*' be a convex cost function such that 
{t G : (3{t) = 0} = [0,to], where to > 0 is some positive constant. The following are 
equivalent: 

{i) There exists a > 0 such that g satisfies the transport-entropy inequality T(/3(a •)). 
{ii) There exists a' > 0 such that g satisfies the weak transport-entropy inequality 
T(/3(a'-)). 

{Hi) There exists 6 > 0 such that max{K~^{b), K (6)) < 00 , where 

1 POO 

K+{b) := sup —-. / 

x^m cxd) Jx 

A-(6) := sup —^-r r ef^^^^^-^^'>g{du), 

x^va /^( 00 , 37 ) J —00 

and m is a median of g. (Here we use the convention 0/0 = 0./ 

{iv) There exists d > 0 such that 

\U^{u) - U^,{v)\ ^ Vn / u G M. 

(Note that /3“^ is well defined on (0,oo)./ 

In particular, 

• {i) {ii) with a' = a, 

• {ii) => {in) with b = a'/2, 

• {ii) => {iv) with d = a' g/j-qiogSl ^ 

• {iv) {ii) with a' = 
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Proof of Theorem 6.1. The equivalence between Items (i), {in) and {iv) is proved in [18, 
Theorem 2.2], with some explicit dependency between the constants. In order to complete 
the proof of Theorem 6.1 we need to show that (z) {ii) =► {in). 

It follows from Jensen’s inequality that Ta(a.)(/z,u) ^ max )(zz|/z); 7g(a.• 
Therefore (z) implies {ii) with a' = a. 

Next we will show that {ii) implies {in). Assume that fi satisfies T(/3(a'-)) for some 
a' > 0. According to Item {iv) of Lemma 4.1, for all convex functions 5 : M —?■ M bounded 
from below, it holds 

[ exp((5/) dn f e~f d/z ^ 1 , where Qf{x) = inf {/(z/) + 2/3(o'|z/ - x|/ 2 )}. 

J J y6R 

Consider the convex function fx which equals to 0 on (— 00 , x] and 00 otherwise. Then 
Qfx{y) = 0 on (—oo,x] and Qfx{y) = 2/3(a'(z/ — x)/2) on (x,oo). Thus, applying the 
inequality above to fx yields 

(fi{-oo,x] + f ) y{-oo,x] ^ 1. 

V J{x,oo) J 

Considering x ^ m yields that K'^{a'/2) ^ 3. One proves similarly that K~{a'I2) ^ 3. 
This shows that {ii) implies {in) with b = a'/2. This achieves the proof of Theorem 
6 . 1 . □ 


We are now in a position to prove Theorem 1.2. 


Proof of Theorem 1.2. Let 9 : M"*" —>■ M"*" be a convex cost function such that 6{f) = on 
[0,fo] for some to > 0. Let us define 9i{t) = on [0,to] and 0i{t) = 2tto — on [to,+ 00 ) 
and 02 (t) = [0(t) — + - Note that 9i and 02 are both convex and that 02 vanishes on 

[0,to] and that max( 0 i, 02 ) ^ 0 ^ 0 i + 02 - 

First assume that n satisfies the weak transport-entropy inequality T(0(a-)) for some 
o > 0 {i.e. Item (z) of Theorem 1.2). Then, since 0 ^ 02 it clearly satisfies T( 02 (a-)). 
According to Theorem 6.1, the mapping sending the exponential measure on ^ satisfies 
the condition: 

(20) supUf^{x + u) - Ufj,{x) ^ ^9f^{u), Vzz > 0, 

with b = QKi, where Ki = to/(80^^(log3)). Since 9f^{u) = 6~^{u + t^) this proves Item 
{ii) of Theorem 1.2 with the announced dependency between the constants. 

Now assume that /z satisfies Item {ii) of Theorem 1.2, or equivalently (20) for some 
6 > 0. Recall that we set, in Theorem 1.5, k := 5480 and c := l/{lt)^/2). Then, observe 
that, plugging zz = 1 into (20) and using Theorem 1.5, one concludes that /z satisfies T(q;) 
with a defined by a{u) = a{u/V2D), with D = k (0~^(1 +1^))^ and 


if juj ^ Ca/k/2 
c^/k\v\ — ^ if juj > c^/kJ2 


z;2 if \v\ ^ a/137/10 

2Vm\v\ - Ifl if izz| > c a/137/10. 


It is not difficult to check that a compares to 0i. More precisely, for all u G M, it holds 

/ ^i377Tb_ \ , A 


a{v) ^ 01 ^max ^ ^ ^ ; 1^ juj^ = 0i ^max 




Therefore /z satisfies T(0i(ai •)), and by monotonicity T(0i(ai •)) with 

, _ max((c\//72)/io; 1 ), _ V 1 max(l,to) 

V^e-\i + ti) AVm59-^{i + ti) " 105 toe-^{i + ti) '-'^^ 












24 


N. GOZLAN, C. ROBERTO, P.-M. SAMSON, Y. SHU, P. TETALI 


On the other hand, according to Theorem 6.1, fi also satisfies T(02(a2-))) with 02 = 
g-i(* 2 +f 2 ) Letting a = min(ai,a 2 ), one concludes that /i satisfies both T(0i(a-)) and 
T(02 (a • ))• Hence, since 6 {at) ^ 6 i{at) + 92 {at) and according to (5), it holds 

^ (a ■) (l^l M) + .) (p|^) 

^ 2H(u|/i), 

and so ^ satisfies T^(20 (a-)). By convexity of 9 and since 0(0) = 0, it holds 2^(2at) ^ 
9{at), and so n satisfies T'''(0((a/2) •)). Finally we observe that 

a b . f max(l,to) lOSto \ ^ b min(l,to) ^ 

2 “ ^ \to9-^{l + tiy '9-\2 + tl)) ^ 2^'9-^{2 + tl) 

so that, by monotonicity, /i satisfies T'*^(0 (k20 • ))• The same reasoning yields the conclusion 
that fj, satisfies T {K 2 b -)), which completes the proof. □ 
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